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1.  INTRODUCTION 


In  a  number  of  situations,  it  is  of  interest  to  find  out  whether  the 
addition  of  a  new  set  of  variables  gives  additional  information  for  inference. 
For  example,  in  the  area  of  principal  component  analysis,  it  is  of  interest 
to  find  out  whether  the  new  variables  contribute  to  explanation  of  the 
variation  among  experimental  units.  In  the  area  of  multi-group  discriminant 
ananlysis,  it  is  important  to  find  out  whether  the  addition  of  new  variables 
contributes  to  the  discrimination  between  the  groups.  Similarly,  in  the 
area  of  canonical  correlation  analysis,  it  is  of  interest  to  find  out  as  to 
whether  the  addition  of  variables  to  one  or  both  sets  of  variables  contributes 
to  the  degree  of  association  between  the  two  sets  of  variables. 

Rao  (1966)  considered  the  effect  of  additional  variables  on  the  efficiency 
of  estimates  and  the  power  of  the  test  under  multivariate  regression  model. 
Recently,  Wljsman  (1984)  derived  asymptotic  distribution  of  the  Increase  in 
the  largest  sample  canonical  correlation  whem  some  variables  are  added.  In 
Section  3  of  this  paper,  we  first  derive  asymptotic  distributions  of  changes 
In  functions  of  the  eigenvalues  of  the  sample  covariance  matrix.  Asymptotic 
distributions  of  changes  in  functions  of  the  eigenvalues  of  the  multivariate 
analysis  of  variance  (MANOVA)  matrix  when  some  variables  are  added  are  derived 
in  Section  4.  In  Section  5,  we  derive  asymptotic  distributions  of  changes  in 
certain  functions  of  the  sample  canonical  correlations  when  new  variables  are 
added  to  one  or  both  sets  of  original  variables.  The  above  results  are  derived 
under  the  assumption  that  the  underlying  distribution  is  multivariate  normal. 
Further  results  are  given  in  Section  6. 


2.  PRELIMINARIES 


In  this  section,  we  state  the  three  lemmas  which  are  needed  in  the  sequel. 
Lemma  2. 1.  [Cramer  (1946,  p.366)].  Let  x^  be  a  p-component  random  vector 

and  y:  p  x  1  be  a  fixed  vector.  Assume  /n"  (x  -  u)  converges  to 
N(0,E)  in  law.  Let  f(x)  be  continuously  differentiable  in  a  neighborhood 
of  y  and  let  £  ■  3f  (x) /3x|xat^.  Then  the  limiting  distribution  is  the  same 

as  the  one  of  /n"  £'(x  -  y) ,  which  is  N(0, £’E£). 

_  _n  _  _  _ 

Lemma  2.2.  Let  nS  be  distributed  as  a  Wishart  distribution  W  (E,n)  and  B 

-  p 

be  distributed  as  a  noncentral  Wishart  distribution  W  (E,q:fi).  Assume 

P 

ft  ■  0(n)  ■  n0.  Then  the  limiting  distributions  of  V  -  /n~  (S  -  E)  and 

U  *  /n"  (A  b  -  0)  are  multivariate  normal.  Further  the  limiting  distributions 

2  2 

of  trCV  and  trCU  are  N(0,o1)  and  N(0,o2)  respectively,  where  C  Is  a 

2  2  2  2 

symmetric  matrix  of  order  p  x  Pi  flj  ■  2  tr(CE)  and  a2  *  4  tr  C  0. 

This  lemma  is  well  known  and  is  proved  by  considering  the  ch.  functions 
of  V  and  U. 

Lemma  2.3.  Let  Q  and  S2  n  he  sequences  of  symmetric  matrices  of  order 

p  x  p  such  that  the  limiting  distributions  of  V.  -  /n~  (S,  -  A)  and 

x  ,n  x,n 

V„  «  /tT  (S,  -  I  )  are  multivariate  normal,  where  A  -  diag(X. ,. . . ,X  ), 

£*11  p  1  p 

Xj  Xp  and  1^  is  the  identity  matrix  of  order  p  x  p.  Let 

\ 

t.  >,...,>  t  and  d.  >,...,>  d  be  the  eigenvalues  of  S,  and  S,  Si*  , 

i  "  *"  P  1  “  p  1  jti  1  |H  2  ,n 

respectively.  Suppose  that  the  a-th  largest  eigenvalue  X^  of  A  is  simple. 

Then  the  limiting  distributions  of  /n-  ( t  -X  )  and  /n”  (d  -X  )  are  the  same 

a  a  a  a 

as  the  ones  of  (V^)^  and  (V1>n)oa  -  (V2>n)ao  respectively,  where  (A)a(J 

denotes  the  (a,8)-th  element  of  a  matrix  A. 

This  lemma  has  been  essentially  proved  in  the  papers  of  Hsu  (1941a,b)  and 
Anderson  (1963)  who  treated  the  general  case  of  multiple  roots. 


3.  PRINCIPAL  COMPONENT  ANALYSIS 


In  the  area  o£  principal  component  analysis,  it  is  of  interest  to 
find  out  a  small  number  of  principal  components  which  would  adequately 
explain  the  variation  among  experimental  units.  In  the  population,  the 
variance  of  i-th  Important  principal  component  is  the  i-th  largest 
eigenvalue  of  the  population  covariance  matrix.  If  these  eigenvalues 
are  small,  then  the  corresponding  principal  components  are  unimportant. 

In  a  number  of  situations,  it  is  of  interest  to  find  out  as  to  whether 
the  addition  of  some  variables  will  increase  the  variances  of  the  first 
few  Important  principal  components.  Similarly,  it  is  of  interest  to  find 
out  whether  there  is  significant  Increase  in  the  ratio  of  the  1-th 
largest  eigenvalue  to  the  trace  of  the  covariance  matrix  if  some  variables 
are  added.  So,  we  will  derive  asymptotic  distributions  of  increases  in 
certain  functions  of  the  eigenvalues  of  the  sample  covariance  matrix 
when  a  new  set  of  variables  is  added. 

Let  x:  px  1  be  distributed  as  N(0,E).  We  partition  x  -  (x£,x£), 
x^:  Pj^  *  1  and 


I 

I 


11  E12 
21  ZZ2 


'll5  pl  X  pl* 


(3.1) 


Suppose  the  vector  x^  is  augmented  to  x.  Let  X^  and  X^  be  the  o-th 
largest  roots  of  E^  and  E,  respectively.  Then 


X  >  X  (oi  ■  1,  •••  >  P-« ) 
a  —  a 

which  follows  frcn  the  Poincare  separation  theorem  (see,  e.g. ,  Rao  (1973,  p.64  )). 

We  are  interested  in  the  Increases  <5a  ■  Xa  -  Xa.  Let  S  be  the  sample 
covariance  matrix  based  on  a  sample  of  size  N  ■  N  +  1 .  We  partition  S  as  In  (3.1 


The  sample  quantities  corresponding  to  are 

d  -  l  -  l  ,  (o  -  1,  ...,  p.) 
a  a  a  A 


(3.2) 


where  t.%  and  are  the  a-th  largest  roots  of  and  S,  respectively 
Ue  consider  the  distribution  of 


J  -  ^  {f(d. . dn  )  -  f (6- . «  )} 

1  Pi  1  Pi 


We  assume 


Al:  f(d)  Is  continuously  differentiable  In  a 
neighborhood  of  d  ■  5 

where  d  »  (d-,...,d  )'  and  5  *  (5 6  )’.  Let 

-  1  -  1  P^ 

c  -  (c.,...,c  )'  -  jt  f(d)| 

1  P1  3!  -  'd  -  5 

Let  H^:  p ,  x  p„,  be  an  orthogonal  matrix  such  that 

HllZllHil  “  A11  "  di*8  ^X1 . Xp  ** 


(3.3) 


(3. A) 


Since  l  and  l  are  invariant  under  the  transformation 
a  a 

fHu  °1 


we  may  assume 


0  I 


I  -  A 


A11  A12 


A21  A22 


(3.5) 


where  A, 


H11Z12»  A21  "  Z21Hil  ^  A5 


E 22 •  Let 


S  -  A  +  —  V 


(3.6) 


s  -  r'sr 

-  a  +  —  r'  v  r 


(3.7) 


Q'Miirv 

lf*SPSOT£D 


where  r  is  an  orthogonal  matrix  such  that  r 'AT  -  A  »  diag  (A^ . A?) . 

Since  l  and  l  are  the  a-th  largest  roots  of  S.,  *  A,,  +  (1/*^T)  V,,  an 
a  a  11  11  11 

s  -  a  +  (i/v£)  r'vr,  we  obtain  by  Lemma  2.3  that  the  asymptotic 

distribution  of  i^n  (d  -  6  )  is  the  same  as  that  of 

a  a 

g  -  (r'vr)  -  (v)  (3.8) 

®a  aa  aa 

•m  I 

if  Xq  and  Aq  are  simple.  Using  Lemma  2.1  we  obtain  that  the  asymptotic 
distribution  of  J  is  the  same  as  that  of 


l  eg  =  tr  A  V 
a°a 

a-1 


(3.9) 


where 


A  *  TD  T '  -  D  ,  D  -  diag(c- , . . .  ,c  ,0, ... ,0) . 

C  C  C  1  n 


(3.10) 


This  Implies  the  following. 

Theorem  3.1.  Let  nS  be  distributed  as  a  Wishart  distribution 

W  (Z ,n) .  Let  d  -  l  -l  and  S  •  A  -  A  ,  where  l  ,  l  ,  A  and  A 
p  ’  a  a  a  a  a  a’  a’  o’  a  a 

are  the  a-th  largest  roots  of  S^,  S,  Z^  and  Z.  Assume  a  function 

f(d1,...,d  )  satisfies  the  assumption  Al  and  all  the  roots  A  and 

1  Pi  “ 

Aa  (a  -  1 . p^)  are  simple.  Then 

/n  {f  (d. , . . .  ,d  )  -  f(61,...,6  )}  N(0,a2) 

1  P-.  1  P-. 


as  n  -*■  »,  where 


<j2  -  2  tr  (AA)2 


2  I  c2(A2  +  A*4)  -  4  H  c  c.aV 

o-i  a  a  0  ote-i  °  6  a  3a 


'2  2 


md  T  '  (yo6)* 

Cor.  3.1.1.  When  A  and  A  are  simple, 

-  o  a 


V  -  <Xa  -  Xc,»  - 

a.  n  .  where  ■  2(X^  +  X  )  -  4  lV  . 

o  aa 


D 


N(0,a2) 


4.  EFFECT  OF  ADDITIONAL  VARIABLES  IN  DISCRIMINANT  ANALYSIS 


In  the  area  of  discriminant  analysis,  it  is  of  Interest  to  find 
out  as  to  whether  the  addition  of  a  new  set  of  variables  will  make  a 
significant  contribution  on  the  discriminant  functions.  This  problem 
can  be  investigated  by  examining  the  Increases  due  to  the  additional 
variables  in  certain  functions  of  the  eigenvalues  of  the  MANOVA 
matrix.  So,  we  will  study  the  asymptotic  distributions  of  the  above 
Increases  in  the  sample. 

Let  W  and  B  be  independently  distributed  as  a  Wishart  distribution 

W  (E,n)  and  a  noncentral  Wishart  distribution  W  (E,q:3) 

P  p 

We  partition 


W11 

w 

W12 

rBll 

B12 

w  - 

9 

B  - 

W21 

w 

“22. 

B21 

B22 

t 

S12 

"11 

"12 

z  - 

9 

> 

Z22j 

"21 

”22 

(4.1) 


with  Wu:  ?1*  px,  Bu:  Px*  Pr  PLX  P1  *U:  *1  X  pl*  Let  V 

a»a  and  be  the  a-th  largest  roots  of  B11W11’  BW  '  ’  H 11s 11 
He-1,  respectively.  Then 


a 

n6 


l  -4  >0, 

a  a  — 


ui  —  oi  >  0,  (o  ■  l,...,p.) 


(4.2) 


a  — 


which  follows  from  the  Poincare  separation  theorem  in  the  case  of  two 
matrices  (also  see  Gabriel  (1968)). 


where  LI  L  =  I  and  is  an  orthogonal  matrix  such  that 

”  diag(a  .  , . . .  ,oo  ),  Since  £  and  £  are  invariant 
P^  ®  ® 

under  the  transformation  B  — ►  TBT *  and  W  — ►  TWT ' ,  we  may  assume 

Z  -  I, 


h  •  n  - 


rii  ni2i 


n21  n22 


(4.4) 


with  «u  -  diag(w1, . . . »  n12  “  H11L11(511L21  +  512L22) *  °21  “  fll2 
and  «22  *  ^L2i\i  +  L22"21^L2l’  +  ^L21T.2  +  L22522^Ll2‘  We  assume 

A2:  fl  -  0(n) 


n  0  -  n 


© 

0 

11 

12 

© 

0 

21 

22 

(4.5) 


where  >  ■  diag(& j , . . . ,0p  ).  Let  T  be  an  orthogonal  matrix  such 


r'  e  r  -  e  -  diag(e, ,...,0  ) 

p 


(4.6) 


with  0,  >...>  0  .  Then  n0  •  oj  .  Let 
1  —  —  p  a  a 

-  B  -  6  +  —  U, 

n  G 

-  W  -  I  +  —  w. 

n  & 


(4.7) 


Then  it  is  easily  seen  that  Z^  and  £^  are  the  ot-th  largest  roots  of 

-  *«  "^Tll)l  ■  0 


I  e  +  ~r'ur-£(i+  —  r ’vr) I  -  o 

respectively,  where  U^:  P^XP^  311(1  ^n:  P^  x  P2  are  the  submatrices  of 
U  and  V  partitioned  as  in  (4.1). 


8 


From  Lemma  2.3,  it  is  seen  that  the  asymptotic  distribution  of  /n”  (dQ  -  6^) 
is  the  same  as  that  of 


(r’UT)  -  9  (r*vr)  -  (v>  +  e  (v) 

oa  a  act  aa  a  a  a 


(4.8) 


if  and  are  simple.  Using  Lemma  1 ,  we  obtain  that  the  asymptomatic 

distribution  of  /n"  (f(d. ,...,d  )  -  f(6.  ,...,d  )}  is  the  same  as  that  of 

P1  P1 


pl 

l  c«8 

a«l 


a  a 


trA^l)V  +  trA^2)U 


where  A 


(1) 


Dc0  -  Dc5r'  •  A 


(2) 


ID  T*  -  D  ,  D  ■  dlag(c. . . ,c  ,0j 

C  C  C  i  Pj 


(4.9) 

. . ,0) , 


diag(c, 9. ,. . . ,c  ,0,. 

11  P  • 


,0) ,  etc.  This  implies  the  following. 


Theorem  4.1.  Let  W  and  B  be  independently  distributed  as  a 

Wishart  distribution  W  (Z,n)  and  a  noncentral  Wishart  distribution 

P 

««r 

W(Z ,q:S) ,  respectively.  Let  d  *•  SL  -  SI  and  n<5  ■  cu  -  ta  ,  where 

P  a  a  a  acta 

~  _i  _i  _  _i 

la*  ^a’  “a  30(1  “a  are  the  a~th  lar8e8t  toots  of  B^W^,  BW  , 
and  H£  \  respectively.  Assume  that  the  assumptions  A1  and  A2  are 
satisfied,  and  and  (a  »  are  simple.  Then 


^  {f(d_,...,d  )  -  f(5  . 5  )}  -2—  N(0,a2) 

1  Pi  -1  Pi 


as  n  -*■  where 


a2  -  2  tr{D  .  -  TD  sr'}2  +  4  tr(TD  I”  -  D  }20 
CO  Co  c  c 


•2[  C  {e  (6  +  2)  +  0  (e  +  2) 

^  a  a  a  a  a 


-  *  \l  .  VbV*  «.  +  » 

a,p  *  l 


and  r  * 
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5.  EFFECT  OF  ADDITIONAL  VARIABLES  ON  CANONICAL  VARIABLES 


In  this  section,  we  study  asymptotic  distributions  of  certain 
statistics  useful  in  studying  the  effect  of  additional  variables  on 
the  canonical  correlations. 

Consider  two  sets  of  variables  x^:  *  1  and  y^:  1.  We  assume 

Pi  £.  q-i  •  Let  p.  >.  ...  >  p  >_  0  be  the  canonical  correlations  between 
11  1 

x^  and  y^.  We  shall  augment  the  variates  x^  and  y^  to  x:  px  1  and 

y:  qx  1  by  adding  extra  variates  X£ :  p£ x  1  and  y^:  q£ x  1,  respectively. 

We  assume  that  (x',y')  is  distributed  as  N  [y,E].  We  partition  E  as 
-  -  P+9 


E  = 


E  E 

xx  xy 

E  E 

yx  yy 


XX 


px  p. 


(5.1) 


Let  p  be  the  a-th  largest  canonical  correlation  between  x  and  y. 
a  „ 

Then 


s„  =  -  0  “  1 . Pi)  • 

a  a  a  —  i 

which  has  been  shown  in  Fujikoshi  (1982). 

Let 


f  S  S 
xx  xy 


S  s 

(  yx  yy  J 


(5.2) 


be  the  sample  covariance  matrix  besed  on  a  sample  of  size  N  =  n  +  1. 

Let  r  and  r  be  the  a-th  largest  canonical  correlations  between  x, 
a  a  >1 

and  y^  and  between  x  and  y.  We  consider  the  asymptotic  distribution  of 


where  d  ■  r 
a  a 


x^n  (f(d1> 


r  .  Let  L. 
a  1 


L.E  L' 
1  xx  1 


. . • ,d  )  -  f (6.  , . . .  ,6  )} 

Pi  1  Pi 

and  Lj  be  the  lower  triangular  matrices  such  that 

-  I  ,  L,E  LI  -  I  .  (5.3) 

P  2  yy  2  q 


We  partition 


* 


Ll*21  Ll*22 


L2*  21  L2- 


Let  p^x  p^  and  Hj.^:  P2 *  P£  the  orthogonal  matrices 

chosen  so  that 


11  r  T'H’  =P 

!•  11  1*11  xyll  2*11  2*11  11 


where 


y  y 

xyll  xy*12 

l  m  »  s  1  1  :  P,  x  cj,  . 

xy  _  _  xyll  *1  H1 

xy*21  xy22 


Then  r  and  r  are  invariant  under  the  transformation 
a  a 

K-nh-n  0  1 


H2* 11L2* 11 


L2*  21  L2*22 


Therefore,  we  may  assume 


E  =  I  ,  E  =  I 
xx  p  yy  q 


E  -  P 
xy 


|P11  P12 


P21  P22 


$ 


1 


I 


With  P11  in  ^5,5^»  P12  "  H1  - HL1  -11  ^Z11L2*21  +  E12L2*22^’ 

P21  “  (L1*21E11  +  L1*22E21)  L2- Linz-!!  P22  “  (L1  -21E11  +  L1.22E21)  L2*21 


+  (L1*21E12  +  L1*22E22)  L2*22‘  Let 


S  «  X  +  —  V 


XP  P 


P'  I 


Sn  V 


XX 

xy 

' 

yx 

V 

yy. 

(5.8) 


Then  r  is  Che  a-ch  root  of 
a 


|s  ..s"1  ,.S  -  r  S  I 

1  xy*ll  yy*ll  yx*ll  xx*ll‘ 


which  is  equivalent  to 


(5.9) 


where 


zn  -  ^<*11-7=  Vu)<V^T.ni>'1<?ii+^=  Vu>  *  pnFii>  .<5-10> 

Sxy»ll*  Vxy*ll*  etc*  denote  t^ie  submatrices  of  and  partitioned  as 

in  (5.6).  Let  and  T ^  be  the  orthogonal  matrices  such  that 


r  pr'  -  P:  p  x  q 


(5.11) 


where  the  (a, a)  elements  of  P  are  p  and  other  elements  are  zero.  Let 

a 


ri  0 


o  r. 


ri  0 


o  r. 


riS*/i  riSxyr2 

r2Syxr2  r2Syyr2 


I  P 
P 


¥nr: 

r.v  r* 

1  xy  2 

r.v  r; 

2  yy  2 

(5.12) 


From  (5.12)  we  obtain  that  rQ  is  the  a-th  largest  root  of 

!pp’  +  -=  Z  -  r2(i  +  -L-  r  v  r!)  |  -  o. 

/xl  JT  1  “  1 

where 


(5.13) 


i  -  /T(i  +^r1v=rp<Y^r2vwry-1<i  -  pp'>  <5.u> 

«• 

He  note  that  the  limiting  distributions  of  Z^  and  Z  are  the  same  as  those 

of  Vu'ii  +  puVn  ‘  puV-npii  “d  riVip’  +  pr2Vi 

a* 

“  pr2Vyyr2P'*  re8Pectively*  Applying  Lemma  2.3  to  (5.9)  and  (5.14)  we  obtain 

that  the  asymptotic  distribution  of  /n~  (d  -  S  )  is  the  same  as  that  of 

a  a 

g„  -  (r.v j'L  -  (v  ) 

a  1  xy  Z  ao  xy  aa 


-  1  »a  <rlWi>a«  *  T  ». 

+  I  i.  (V«  +  I  ».  <Vaa- 


(5.15) 


Therefore  the  asymptotic  distribution  of  •'n  {f(d-,...»d  )  -  f(61,...,5  )) 

-L  p^ 

Is  the  same  as  that  of 

Ica8a“7trAV  (5.16) 

where 


*11 

*12 

*21 

*22 

A^  -  diag(c1P1,...,cp  pp  ,0, . . .  ,0)  -  diag  (c^j^, . . .  ,cp^pp^,0, . . .  ,0) 

*22  '  41*«(elpl . SlPP1,° . 0>  ‘  P2  iia8<0l5l . cp15pl"° . 0)r 

*21  ’  *i2  *  r2°c  1P-  Dc 


and  the  (a, a)  elements  of  Dc:  q»  p  are  cft  for  a  »  l,...,p1  and  the  other 
elements  of  D  are  zero.  This  Implies  the  following. 


Theorem  5.1.  Let  r  and  r  be  the  a-th  largest  canonical 

a  a 

correlations  between  x^  1  and  y^:  x  l  (p^  <_  q^)  and  between 

x:  px  1  and  y:  qxl,  based  on  a  sample  of  size  N  ■  n  +  1  from  N(u,Z). 


Let  p  and  p  be  the  corresponding  population  quantities.  Assume  that 


a  function  f(d^,...,d  )  satisfies  the  assumption  Al  and  the  canonical 


correlations  p^  and  po  (a  -  1 . p^)  are  simple.  Then 


/n  {f(d- . d  )  -  f(6. . 6  )}  N(0,a2) 

1  Pi  1  P1 


as  n  +  «,  where  d  ■  r  -  r  ,6  -  p  -  p  , 

a  a  a*  a  a  a* 


*2  "  2  tr 


*11  *12 

|*21  A22  J 


I  P 
P 


P’  I 


)2 


I 


U  <  «l  -  »a>2  +  <l  -  °a>‘> 

a,  p«l 


*22, 


+  ^  ^  CaCS  (1  “  p8^  {paP6(Yl*6a  +  Y2*6a)  “  2Yl-6aY2*Ba} 

a,p«i 


*nd  ri  '  r2  ’  <T2.;6). 

Cor.  5.1.1.  When  pQ  and  pa  are  simple, 

/n  C(r  -  r  )  -  (p  -  p  )>  — N(0,a2) 
a  a  a  a 

2  *2  2  - 2  "  2 

as  n  -*■  »,  where  a  -  (1  -  p  )  +  (1  -  p  )  tP^P^Y, 

a  a  a  a  i'oio 


2Yl*oaY2« 


Note:  Wljsman  (1984)  proved  the  Corr.  5.1.1  in  the  case  of  a  ■  1  and 


y«?l 


6.  FURTHER  RESULTS 

Let  dQ  and  5q  be  the  increases  in  the  ct-th  largest  sample  and 
population  eigenvalues  in  the  three  cases:  (i)  principal  component  analysis 
(li)  discriminant  analysis,  and  (iil)  canonical  correlation  analysis.  Then 
in  Theorems  3.1,  4.1,  and  5.1,  we  have  shown  that 

J  “  /n”  {f (d. ,.. . ,d  )  -  f  (a. . a  ) }  5  N(0,a2)  .  (6. 1 

1  Pi  1  Pi 

2 

The  limiting  variance  a  depends  on  unknown  E  for  cases  (i)  and  (iii) 

and  unknown  E  and  9  for  case  (il).  In  order  to  make  the  formula  useful 

*2  2 

we  need  to  estimate  a  obtained  from  o  by  replacing  E  by  S  for  (i) 
and  (iii),  and  by  replacing  E  and  0  by  (l/n)W  and  (l/n)B,  respectively 
for  (ii) .  It  is  easy  to  see  that  under  the  same  assumption  as  in  each  of 
the  Theorems 

A 

o  -*•  5  in  probability. 

Therefore,  it  follows  from  (6.1)  that 

o_1J  5  N (0,1)  ,  (6.2 

the  formula  is  useful  in  constructing  an  approximate  confidence  interval  for 
f (^i » • • • ^  • 

It  is  easy  to  extend  the  result  for  a  single  function  J  to  the  one 
for  several  functions.  Let 


for  a 


J  -  /n"  (f  (d.,...,d  )  -  f  (S.,...,5  )} 

a  a  i  p.  a  1  p. 


l,2,...,k.  We  assume  that  fa's  satisfy  the  assumption  Al. 


c  -  (c.  . c  )'  -rr  f  (d) 

.a  1  ,a  Pj  ,a  <3  a  a  _ 


d  -  &. 


. ;  v,V 


£2 

$ 

Then  we  can  prove  that 


J  -  $  Nk(0,Q). 


The  limiting  covariance  matrix  Q  *  (q  Q)  is  given  as  follows: 

OtP 


Case  (i) : 


qn6  "  2  tr(AaAA6A) * 


where  is  defined  from  A  in  (3.10)  by  substituting  c  into  c^. 


Case  (il) 


,  -  2  trA(1V‘>  +  4  trA(2>A<2)e, 

na3  a  3  a  3  * 


where  A^  and  A^2^  are  defined  from  the  A^  and  A^2^  in  (4.9)  by 
a  a  7 


substituting  c  into  c  . 

*«*d 


Case  (ill) 


Q  o  m  T  trA 
^a3  4  o 


I  P 
P 


P’  I 


I  P 
P 


P’  I  J 
9 


where  Aq  is  defined  from  A  in  (5.14)  by  substituting  c  into  ca. 


Higher  order  terms  of  the  joint  distribution  of  Jj,...,Jk  can  be  obtained 


by  using  perturbation  technique. 


Wi 


17 


REFERENCES 


[1] 

[21 

[3] 

[4] 

[5] 

[6] 

[7] 

[8] 
[91 


Anderson,  T.W.  (1963).  Asymptotic  theory  for  principal  component 
analysis.  Ann.  Math.  Statist.  34,  122-148. 

aw 

Cramer,  H.  (1946).  Mathematical  Methods  of  Statistics.  Princeton 
University  Press. 

Fujikoshi,  Y.  (1982).  A  test  for  additional  information  in  canonical 
correlation  analysis.  Ann.  Inst.  Statist.  Math.  34,  523-530. 

Gabriel,  K.R.  (1968).  Simultaneous  test  procedures  in  multivariate 

analysis  of  variance.  Biometrlka  55,  484-504. 

■  **»  **» 

Hsu,  P.L.  (1941a).  On  the  limiting  distribution  of  roots  of  a  deter- 
mlnantal  equation.  J.  London  Math.  Soc.  16,  183-194. 

Hsu,  P.L.  (1941b).  On  the  limiting  distribution  of  the  canonical 
correlations.  Biometrlka  32,  38-45. 

Rao,  C.R.  (1966).  Covariance  adjustment  and  related  problems  in 
multivariate  analysis.  In  Multivariate  Analysis  (P.R.  Krishnaiah, 
editor),  87-103.  Academic  Press,  New  York. 

Rao,  C.R.  (1973).  Linear  Statistical  Inference  and  Its  Applications. 
New  York,  John  Wiley  &  Sons. 

Wijsman,  R.A.  (1984).  Asymptotic  distribution  of  the  Increase  of  the 
largest  canonical  correlation  when  one  of  the  vectors  is  augmented. 

To  appear  in  J.  Multivariate  Anal. 


VfcCvJNir  i  ClAS!»IHI  A  1 1ON  O  ►  ThI*  KAOt  (WUot  i  i>mtm  hbtmtmUj 


REPORT  DOCUMENTATION  PAGE 

READ  INSTRUCTIONS 

BEFORE  COMPLETING  FORM 

1.  REPORT  NUMBER  }  GOVT  ACCESSION  NO. 

AFQfiRjiTK.  ■:  .  0  Ah.AU-%C 

j.  recipient's  catalog  number 

'^f 

«.  title  (mu  Subtitle) 

Effect  of  Additional  Variables  in  Principal 
Component  Analysis,  Discriminant  Analysis  and 
Canonical  Correlation  Analysis 

S  TYPE  OF  REPORT  4  PERIOD  COVERED 

Technical  -  August,  1985 

4  PERFORMING  ORG.  REPORT  NUMBER 

85-31 

7  author^ 

Y.Fujikoshi,  P.R .Krishnaiah,  J.Schmidhammer 

s.  contract  or  grant  numbcrco 

F49620-85-C-0008 

/ 

s.  PERFORMING  organization  name  ano  aooress 

Center  for  Multivariate  Analysis 

515  Thackeray  Hall,  University  of  Pittsburgh 
Pittsburgh,  Pennsylvania  15260 

10.  PROGRAM  ELEMENT.  PROJECT  TASK 
AREA  4  WORK  UNIT  NUMBERS 

<Si\o3Fj  ODo-v/f 

n.  controlling  office  name  and  aooress 

Air  Force  Office,  of  Scientific  Research 

Department  of  the  Air  Force 

Bolling  Air  Force  Base,  DC  20332 

•  2.  REPORT  DATE 

August- 1985 

12.  NUMBER  OF  PAGES 

19 

14.  MONITORING  ACENCY  name  S  AOORESVM  dlllerent  1 tom  Controlling  Olhc*) 

IS.  SECURITY  CLASS,  (ol  Ihta  report) 

Unclassified 

IS*.  OECL  assification/ oowngraoing 
schedule 

IS.  DISTRIBUTION  STATEMENT  (ol  title  XapMlj 

Approved  for  public  release;  distribution  unlimited 

17.  DISTRIBUTION  STATEMENT  (at  Ike  abaltact  entered  In  Block  20,  II  dlUarant  from  Report) 

is.  supplementary  notes 

19  KEY  WORDS  (Continue  on  ravaraa  aid*  II  nacaaamry  and  Identity  by  block  number) 

Asymptotic  distribution  theory,  Canonical  correlation  analysis. 

Discriminant  analysis.  Principal  component  analysis. 

20  ABSTRACT  (Conitnuo  on  rovotoo  tida  It  nacaaaaty  and  Idanttty  tty  block  numbat) 

Din  this  paper,  the  authors  derived  asymptotic  distributions  of  changes  in 
certain  functions  of  the  eigenvalues  of  the  sample  covariance  matrix,  MANOVA 
matrix  and  canonical  correlation  matrix  when  some  variables  are  added  to. the 
original  sets  of  variables.  The  above  results  are  useful  in  finding  out  as 
to  whether  the  new  variables  give /additional  information/ 

DO  H73  Unclassified 


SECURITY  CLASSIFICATION  of  This  PAGE  ^mi»n  Dai  4  Bn tered) 


'4 


